Enhancing the Performance of Multigrid Smoothers in Simultaneous Multithreading Architectures
نویسندگان
چکیده
We have addressed in this paper the implementation of redblack multigrid smoothers on high-end microprocessors. Most of the previous work about this topic has been focused on cache memory issues due to its tremendous impact on performance. In this paper, we have extended these studies taking Simultaneous Multithreading (SMT ) into account. With the introduction of SMT, new possibilities arise, which makes highly advisable a revision of the different alternatives. A new strategy is proposed that focused on inter-thread sharing to tolerate the increasing penalties caused by memory accesses. Performance results on an IBM’s Power5 based system reveal that our alternative scheme can compete with and even improve sophisticated schemes based on tailored loop fusion and tiling transformations aimed at improving temporal locality.
منابع مشابه
Towards a complete FEM-based simulation toolkit on GPUs: Unstructured Grid Finite Element Geometric Multigrid solvers with strong smoothers based on Sparse Approximate Inverses
We describe our FE-gMG solver, a finite element geometric multigrid approach for problems relying on unstructured grids. We augment our GPUand multicore-oriented implementation technique based on cascades of sparse matrix-vector multiplication by applying strong smoothers. In particular, we employ Sparse Approximate Inverse (SPAI) and Stabilised Approximate Inverse (SAINV) techniques. We focus ...
متن کاملBlock-asynchronous Multigrid Smoothers for GPU-accelerated Systems
This paper explores the need for asynchronous iteration algorithms as smoothers in multigrid methods. The hardware target for the new algorithms is top-of-the-line, highly parallel hybrid architectures – multicore-based systems enhanced with GPGPUs. These architectures are the most likely candidates for future highend supercomputers. To pave the road for their efficient use, we must resolve cha...
متن کاملTowards a complete FEM-based simulation toolkit on GPUs: Geometric Multigrid solvers
We describe a GPUand multicore-oriented implementation technique for a key component of finite element based simulation toolkits for partial differential equations on unstructured grids: Geometric Multigrid solvers. We use efficient sparse matrix-vector multiplications throughout the solver pipeline: within the coarse-grid solver, smoothers and even grid transfers. Our implementation can handle...
متن کاملClassification and performance evaluation of simultaneous multithreaded architectures
In this paper we classify simultaneous multithreaded architectures based on how they select instructions issued in a single cycle. This classiication allows us to study the present trend of technology as well as to explore the new avenues for improvements in simultaneous multithreaded architectures. Based on our classi-cation, we study the impact of various parameters of simultaneous multithrea...
متن کاملAlgebraic Tailoring of Discontinuous Galerkin p-Multigrid for Convection
This work presents an element-local algebraic approach to constructing coarse spaces for p-multigrid solvers and preconditioners of high-order discontinuous Galerkin discretizations. The target class of problems is convective systems on unstructured meshes, a class for which traditional p-multigrid typically fails to reach textbook multigrid efficiency due to a mismatch between smoothers and co...
متن کامل